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Abstract: As part of an ongoing effort to develop genetically encoded calcium ion (Ca ) 
indicators we recently described a new variant, designated CH-GEC02.1, that is a genetic 
chimera of the red fluorescent protein (FP) mCherry, calmodulin (CaM), and a peptide that 
binds to Ca 2+ -bound CaM. In contrast to the closely related Ca 2+ indicator R-GECOl, 
CH-GEC02.1 is characterized by a much higher affinity for Ca 2+ and a sensing mechanism 
that does not involve direct modulation of the chromophore pK a . To probe the structural 
basis underlying the differences between CH-GEC02.1 and R-GECOl, and to gain a better 
understanding of the mechanism of CH-GEC02.1, we have constructed, purified, and 
characterized a large number of variants with strategic amino acid substitutions. This effort 
led us to identify Gin 163 as the key residue involved in the conformational change that 
transduces the Ca 2+ binding event into a change in the chromophore environment. In 
addition, we demonstrate that many of the substitutions that differentiate CH-GEC02.1 
and R-GECOl have little influence on both the Kd for Ca + and the sensing mechanism, 
and that the interdomain linkers and interfaces play important roles. 

Keywords: red fluorescent protein; calcium ion; genetically encoded sensor; mutagenesis; 
spectroscopy 
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1. Introduction 

2+ 

Molecular sensors that enable non-invasive fluorescence imaging of intracellular Ca dynamics 
with high spatial and temporal resolution are powerful tools in modern cell biology and neuroscience 

2+ 2+ 

research. As the Ca ion is a universal "second messenger", detection of elevated Ca levels can 
reveal when an intracellular signaling pathway is being activated [1], or when an excitable cell is 
experiencing an action potential [2]. While organic dye-based Ca 2+ indicators have long been a 
mainstay of such research [3], over the last decade proteinaceous Ca 2+ indicators have emerged as a 
preferred alternative for many applications. The major advantage of proteinaceous Ca 2+ indicators is 
that they are genetically encodable and thus can be tissue- selectively expressed and imaged in 
transgenic model organisms [4] . 

The development of proteinaceous Ca + indicators became a possibility only after the discovery, 
cloning, and subsequent optimization of the Aequorea victoria green FP [5]. Due to its inherent ability 
to generate a chromophore through an autonomous series of post-translational modifications, the green 
FP uniquely provides a means of genetically encoding a fluorophore. The first FP-derived Ca 2+ 

2+ 

indicators were based on the Ca -dependent modulation of Forster Resonance Energy Transfer 
(FRET) efficiency between two FP variants with different hues [1]. Subsequent protein engineering 

2+ 

efforts led to the development FP-derived Ca indicators based on a single FP that exhibited an 

2+ 

intensiometric response of Ca [6,7]. Such single FP-derived indicators are composed of a circularly 
permuted FP variant with new termini in close proximity to the centrally located chromophore. Fused 

2+ 

to one of these new termini is the Ca -binding protein CaM, and fused to the other termini is the Ml 3 

2+ 

peptide that binds to Ca -bound CaM (Figure 1A). The generally accepted mechanism of such sensors 
is that the chromophore is exposed to the solvent and the fluorescence is largely quenched in the 
absence of Ca 2+ . Binding of Ca 2+ causes an interaction between CaM and its peptide binding partner 
that stabilizes the chromophore in a conformation and environment where fluorescent brightness is 
increased due to higher quantum yield and/or extinction coefficient. Such a stabilizing interaction is 
apparent in the X-ray crystal structure of R-GECOl, where a lysine side chain from a neighboring 
P- strand is stabilizing the fluorescent phenolate form of the chromophore in the Ca 2+ -bound state 
(Figure IB) [8]. Similarly, in GCaMP2 [9,10], an arginine side chain from CaM bridges the CaM to FP 
interface and stabilize the phenolate form of the chromophore in the Ca + -bound state (Figure 1C). 

Since 2001 when the single FP Ca 2+ indicator design was first reported [6,7], dedicated optimization 
efforts have produced an ever-improving series of improved GCaMP variants [2,11-13]. Recent 
years have also seen the introduction of an expanded color palette of Ca 2+ indicators based on both 
engineered versions of Aequorea green FP [8,14] as well as homologous red fluorescent Anthozoan 
FPs from organisms such as Entacmaea quadricolor sea anemone [8] and Discosoma sp. coral [14,15]. 
The prototypical red fluorescent indicator is R-GECOl [14] that was engineered from the 
Discosoma-denwed FP known as mApple [16]. Some of the most recent additions to this growing 
selection of red fluorescent indicators are CH-GECO2.0 and CH-GEC02.1 [17], which were 
engineered from the popular Discosoma-derived red FP known as mCherry [18]. 
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Figure 1. The structure of a single FP-based Ca" + indicator. (A) Cartoon representation of 
the X-ray crystal structure of R-GECOl [8]. Domains and linkers are colored according to 
the sequence alignment shown in Figure 2. The "cp linker" is the linker used to connect the 
original N-and C-termini of the circularly permuted (cp) FP. (B) Zoom in on the circular 
permutation site of R-GECOl, where the chromophore is exposed to the interface between 
the CaM and FP domains. (C) The circular permutation site of GCaMP2 (PDB ID 3EVR) 
represented from a similar perspective to (B) [9]. 




Although they are both ultimately derived from the same wild-type protein, the extensive processes 
of directed evolution that led to R-GECOl and CH-GEC02.1 have introduced a considerable number 
of differences in the amino acid sequences (Figure 2). These differences include 26 amino acid 
substitutions (LeulMet, Glu6Val, Ile7Phe, Ala40Gly, Phe41Thr, Phe83Trp, Ilel04Val, Ilel05Val, 
Hisl06Thr, Asnl08Thr, Prol31Ser, Aspl32Asn, Glul44Leu, Serl47Thr, Metl50Leu, Serl59Gly, 
Lysl63Gln, Glyl64Arg, Argl66Lys, Glyl91Asp, Cys214Tyr, Asp21Gly (CaM), Asp23Ala (CaM), 
Phe61Leu (CaM), Thr77Ser (CaM), and Aspl09Asn (CaM), relative to R-GECOl), as well as 
differences in the interdomain linkers. In addition, they exhibit some quite dramatic differences in their 
respective K d s for binding to Ca 2+ (6 nM for CH-GEC02.1 vs. 480 nM for R-GECOl) and even in the 
underlying mechanism by which this sensing occurs [17]. Specifically, R-GECOl operates on the 
basis of Ca + -dependent shift in the chromophore pK a [14], whereas the Ca 2+ sensing mechanism of 
CH-GEC02.1 appears to depend on the interaction of the chromophore with a yet unidentified 
ionizable amino acid side chain with a pK a of -6.5 to 7.0 [17]. 
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Figure 2. Sequence alignment of Ca 2+ indicators discussed in this work. Substitutions 
relative to CH-GEC02. 1 are highlighted with white text on a black background. Adapted 
in part from Carlson and Campbell [17]. 
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In an effort to identify which amino acid substitutions are responsible for the differences between 
CH-GEC02.1 and R-GECO 1, and to obtain insight into the mechanism of CH-GEC02.1, we now 
report the characterization of a barrage of single-site mutants of CH-GEC02.1. Each variant has been 
characterized in terms of its Ca 2+ Kd as well as its fluorescence intensity as a function of pH both in the 
presence and absence of Ca 2+ . Interpreting the results of these studies in the context of the mCherry 
and R-GECO 1 crystal structures has allowed us to propose a mechanistic basis for the response of 
CH-GEC02.1 to Ca 2+ binding. 



2. Materials and Methods 

2.1. Mutagenesis 

All molecular biology procedures were carried out using genes encoding either CH-GECO2.0, 
CH-GEC02.1, or R-GECOl in pBAD/His B, as previously described [17]. All site-directed mutagenesis 
was performed using the Quikchange lightning mutagenesis kit (Agilent, Santa Clara, CA, USA) and 
primers designed according to the manufacturers guidelines. 
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2.2. Plasmid Purification 

All plasmid DNA was purified from bacteria using a chloroform extraction protocol. Briefly, 
150 uL of solution I (50 mM Tris, 10 mM EDTA, 100 ug/mL RNaseA, pH 8.0) was used to resuspend 
the bacterial pellet. Then 150 uL of solution II (1% SDS, 0.2 M NaOH) was added and the mixture is 
gently inverted several times. Solution III (2 M acetic acid, 3 M KOAc, pH 5.5) was added to a total 
volume of 450 uL and mixed to pellet the non-soluble cell debris. Finally, 150 uL of chloroform was 
added and mixed several times before being centrifuged at 14,000 rpm, 4 °C for 5 min. The top 
aqueous layer was separated, mixed with 800 uL of 100% ethanol, and then centrifuged at 4 °C for 
5 min. The DNA pellet was washed with 500 uL of 70% ethanol, allowed to air-dry, and then 
dissolved in distilled water. Coding sequences of all gene variants were sequenced using BigDye 
Terminator v3.1 Cycle Sequencing Kit (Life Technologies, Carlsbad, CA, USA) and reactions were 
analyzed by the University of Alberta Molecular Biology Services Unit. 

2.3. Protein Expression and Purification 

Escherichia coli strain DH10B was transformed with the plasmid of interest by electroporation. 
Transformed bacteria were grown overnight on solid media containing ampicillin, and then a single 
colony was picked and grown overnight in 5 mL Luria broth (LB) supplemented with ampicillin at 
37 °C. The 5 mL culture was then used to inoculate 250 mL of Terrific broth (TB) and grown to an 
optical density of 0.6. Protein expression was induced with 0.004% arabinose and the culture was 
grown overnight at 37 °C or for two nights at 30 °C, depending on the brightness of the protein 
construct. Bacteria were pelleted at 10,000 rpm, 4 °C for 10 min and the pellet was then resuspended 
in 10 mM Tris-Cl, 150 mM NaCl pH 7.4 at 4 °C. Cells were lysed using a cell disruptor (Constant 
Systems Ltd., Daventry, United Kingdom) and the debris pelleted at 14,000 rpm. Protein was purified 
from the supernatant by Ni-NTA affinity chromatography (Qiagen, Hilden, Germany) according to the 
manufactures instructions. Briefly, Ni-NTA beads were collected on a column with a vacuum manifold 
and washed twice with 10 mM Tris-Cl, 30 mM imidazole, 150 mM NaCl pH 8.0. The beads were 
gravity washed once and then eluted with 300 mM imidazole, 10 mM Tris-Cl pH 8.0. The excess 
imidazole was removed via buffer exchange with Amicon columns (MWCO 10,000) and 10 mM Tris-Cl, 
150 mM NaCl, pH 7.3. 

2.4. pH Titrations 

Fluorescence intensity as a function of pH was determined by dispensing 5 uL of the protein 
solution into 50 uL of the desired pH buffer in triplicate into a 396-well clear-bottomed plate (Thermo 
Fisher Scientific, Waltham, MA, USA). Buffer solutions were prepared by adjusting the pH of a 
solution of 30 mM trisodium citrate and 30 mM sodium borate to pH 1 1.5. The pH of the solution was 
then adjusted with HC1 (12 M and 1 M) and 10-15 mL was collected at pH value intervals of 0.5, plus 
two additional solutions at pH 5.25 and 5.75. Buffers were prepared both without (30 mM MOPS, 
100 mM KC1, 10 mM EGTA, pH 7.2) and with (30 mM MOPS, 100 mM KC1, 10 mM CaEGTA, 
pH 7.2) Ca + . Fluorescence emission for each solution was recorded using a Tecan (Maennedorf, 
Switzerland) Safire2 microplate reader. 
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2.5. Ca Titrations 

2+ 

The apparent K d for Ca response was determined by mixing the protein solution with buffers 

2+ 

containing various amount of Ca , prepared as described in the Calcium Calibration Buffer Kit from 
Life Technologies (Carlsbad, CA, USA). The Ca 2+ -free buffer (30 mM MOPS, 100 mM KC1, 10 mM 
EGTA, pH 7.2) and Ca 2+ -saturated buffer (30 mM MOPS, 100 mM KC1, 10 mM CaEGTA, pH 7.2) 
were mixed in different ratios to generate buffers with Ca 2+ concentrations ranging from zero Ca 2+ to 
39 uM Ca 2+ . Similar to the pH titrations, 5-10 uL of the protein was mixed with 150-200 uL of each 
Ca 2+ buffer. 50 uL of each solution was aliquotted in triplicate into a 3 96 -well plate and the 
fluorescence emission was recorded using the plate reader. Emission peaks were integrated and plotted 
against the log of the calculated free Ca 2+ concentration. Ca + titration curves were fit with a sigmoidal 
curve in order to obtain the K d and the Hill coefficient. 

3. Results and Discussion 

3.1. Probing the Determinants of the Ca 2+ -Binding Affinity 

CH-GECO2.0 (K d = 28 nM) and CH-GEC02.1 (K d = 6 nM) [17] differ by only 3 substitutions in 
the CaM domain (Gly21Asp, Leu61Phe, and Ser77Thr, relative to CH-GEC02.1) and 2 substitutions 
in the FP domain (Thrl47Ile and Aspl91Gly) (Figures 2 and 3A). However the K d s for Ca of these 
two proteins differ by a factor of 4.7. Furthermore, CH-GECO2.0 and R-GECOl (K d = 480 nM) [14] 
differ by only 1 substitution in the CaM domain (Ala23Asp, relative to CH-GECO2.0), but have K d s 
that differ by a factor of 17. The M13 peptide domain of all three proteins is identical. To determine 
which individual mutations, or combination of mutations, were responsible for the differences in K d 
values, we used site directed mutagenesis to systematically revert mutations and then determined the 
effect on Ca affinity. Notably, in wild-type CaM, Asp21 and Asp23 are two of the key Ca chelating 
residues of the first EF hand [19]. 

We first introduced mutations to revert the CaM sequence of CH-GEC02.1 back to CH-GECO2.0. 
Results for all mutations discussed in this work are summarized in Supplementary Table 1. We 
determined that CH-GEC02.1 with Gly21Asp or Ser77Thr gave K d values of 7 nM and 8 nM 
respectively (Figure 3B). We were unable to purify any soluble protein for CH-GEC02.1 Leu61Phe, 
which is located near the second EF hand of CaM. Previous studies of CaM have demonstrated that 
mutations in the second EF hand are more detrimental to the Ca 2+ affinity than mutations in the first 
EF hand [20]. The combination of Gly21Asp and Ser77Thr results in a K d of 13 nM, which is 
somewhat higher than either substitution alone. Addition of the Leu61Phe mutation yielded a K d of 
44 nM for the triple mutant (Figure 3C). Overall, this result demonstrates that these three mutations in 
the CaM domain account for most of the difference between the K d values of CH-GECO2.0 and 
CH-GEC02.1. The fact that the K d values of CH-GECO2.0 (28 nM) and the triple mutant of 
CH-GEC02.1 (44 nM) are not identical indicates that the two substitutions in the FP domain 
(Ilel47Thr and Glyl91Asp) must also have a subtle influence on the K d , likely through interactions at 
the interface between CaM and the FP domain. 
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Figure 3. Probing the differences in Ca 2+ affinity between CH-GECO2.0, CH-GEC02.1, 
and R-GECOl. (A) Location of amino acid substitutions that differentiate the indicators, 
represented using the R-GECOl crystal structure [8]. Mutations are labeled relative to 
CH-GEC02.1. (B) Ca 2+ titration curves for variants with single mutations that revert 
CH-GEC02.1 CaM to CH-GECO2.0 CaM. (C) Ca 2+ titration curves for variants with 
multiple mutations that revert the CaM domain of CH-GEC02.1 to that of CH-GECO2.0. 

(D) Ca 2+ titrations for two of the point mutants that revert CH-GEC02.1 to R-GECOl. 

2+ 

(E) Ca titrations for CH-GEC02.1 variants with the CaM domain partially and completely 
converted to the R-GECOl CaM domain. 
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We next introduced mutations intended to probe the differences between the CaM domain of 
R-GECOl (K d = 480 nM) and CH-GEC02.1 (K d = 6 nM) variants, which have almost a 2-order of 

2~T" 

magnitude difference in their ^s for Ca . These two proteins differ by a total of 5 substitutions in the 
CaM domain (Gly21Asp, Ala23Asp, Leu61Phe, SerWThr, and Asnl09Asp, relative to CH-GEC02.1) 
(Figures 2 and 3A). Introduction of the single mutations Ala23Asp or Asnl09Asp into CH-GEC02.1 
gave essentially unchanged K&S of 7 nM and 13 nM, respectively (Figure 3D). To fully convert the 
CH-GEC02.1 CaM to that of R-GECOl CaM we proceeded to introduce additional substitutions. As 
discussed above, CH-GEC02.1 with Gly21Asp, Leu61Phe and Ser77Thr had an increased K d of 
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44 nM. The addition of Ala23Asp slightly lowered the Kd to 23 nM. Further reversion of a mutation 
close to the FP-CaM interface (Aspl91Gly) and one final mutation in CaM that is relatively distant 
from the EF hands (Asnl09Asp) produced a protein with a Kd of 35 nM (Figure 3E). These results 
clearly demonstrate that the dramatic difference in K d between R-GECOl and CH-GEC02.1 cannot be 
fully explained by the mutations within the CaM domain. Rather, these changes must be due to the 
interactions between the CaM and FP domains, either via the linkers or the domain interface. 

3.2. Investigating the Mechanism of CH-GECO Variants 

To gain a better understanding of the differences in magnitude and mechanisms of the fluorescent 
responses of CH-GECO2.0, CH-GEC02.1, and R-GECOl, we created, characterized, and compared a 
series of protein variants with strategic substitutions at key residues of interest. 

3.2.1. CH-GECO2.0 vs. CH-GEC02.1 

Just two mutations in the FP domain (Ilel47Thr and Glyl91Asp, relative to CH-GECO2.0) 
differentiate CH-GECO2.0 and CH-GEC02.1, but their fluorescent responses to Ca 2+ are 150% and 
250%, respectively. While the Aspl91Gly reversion variant of CH-GEC02.1 variant retained a similar 

2+ 

response to CH-GEC02.1, reversion of position 147 to He resulted in a decrease in the Ca response 
to 115%. This result indicates that the Ilel47Thr mutation is primarily responsible for the improved 
performance of CH-GEC02.1, and is consistent with the fact that the side chain of residue 147 is in 
close proximity to the phenolate moiety of the chromophore (Figure IB). 

3.2.2. R-GECOl vs. CH-GEC02.1: The Influence of the Linkers 

Previous work in our lab with both single FP-based and FRET-based indicators has showed that the 
length and composition of the linkers between the protein domains can have a substantial effect on the 
response of the indicator to the analyte of interest [14,21,22]. The sequence alignment between 
R-GECOl and CH-GEC02.1 (Figure 2) reveals that the linkers between the three domains are 
significantly different. Specifically, R-GECOl has a shorter linker between M13 and the FP 
(Pro-Val-Val) than does CH-GEC02.1 (Leu-Glu-Ser-Leu) and the first two residues (Leu 145 and 
Serl46) of the FP domain are deleted. R-GECOl also has a shorter linker between CaM and the FP 
domain (Ala-Thr-Arg) than does CH-GEC02.1 (Gly-Gly-Thr-Arg). The Thr-Arg residues of both 
linkers correspond to the translation of a Mlul restriction site. 

Replacing the Leu-Glu-Ser-Leu linker and residues 145 and 146 of CH-GEC02.1 with Pro-Val-Val 
resulted in a dim red fluorescent protein (designated CH-GEC02.1-PVV). UV-vis spectroscopy 
revealed that a large portion of the protein was halted at the blue intermediate species that absorbs at 
382 nm (Supplementary Figure 1A) [23]. As shown in Supplementary Figure 1C, the pH dependence 
of the red fluorescent component of CH-GEC02.1-PVV was quite different from that of CH-GEC02.1 
(Figure 4A), and the response to Ca + was effectively abolished. Replacing the FP to CaM 
Gly-Gly-Thr-Arg linker of CH-GEC02.1 with Ala-Thr-Arg and introducing the adjacent Leul44Asp 
mutation, fully converted the CH-GEC02.1 into a variant with identical linkers to R-GECOl (designated 
CH-GEC02.1-PVV-ATA). However, no red fluorescent protein could be recovered from repeated 
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attempts to express this protein. These results indicate that, in addition to their critical role in the Ca 
response mechanism of single FP indicators, the interdomain linkers can also have an important role in 
the chromophore maturation. 



Figure 4. Representative fluorescence vs. pH titration curves for CH-GEC02.1 variants. 
(A) CH-GEC02.1 titration in the presence and absence of Ca 2+ , adapted from Carlson and 
Campbell [17]. (B-I) Titration curves for additional variants discussed in the text, both in 
the presence and absence of Ca 2+ . 
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3.2.3. Attempts to Identify Key Residues Involved in the Ca 2+ Response of CH-GEC02.1 



From the CH-GEC02.1 pH titration curve reproduced in Figure 4A, it is apparent that there is a key 
ionization with a pK a around pH 6.5 to 7.0. We hypothesized that, at physiological pH and in the 
absence of Ca 2+ , there is a specific residue (of unknown identity) for which the deprotonated form is 
quenching the fluorescence of the chromophore [17]. In the presence of Ca 2+ either this residue is no 
longer deprotonated due to a change in pK a , or it is no longer in the right conformation to quench 
the chromophore fluorescence. In an attempt to determine the identity of the residue that is so 
critical to the function of the CH-GECO variants, we performed mutagenesis of rationally selected 
candidate residues. 

As the pKn in question is around 6.5 to 7.0, the most likely candidate residue was histidine. 
However, the X-ray crystal structure of mCherry [24] reveals no histidines in the immediate vicinity of 
the chromophore, so we looked to histidine residues that could reasonably be interacting with the 
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chromophore by an extended hydrogen bond network. One likely candidate was His75, which appears 
to participate in a hydrogen bond network with Tyrl93, Lys70, Glul48, several water molecules, and 
the chromophore (Supplementary Figure 2A) in the mCherry crystal structure [24]. We mutated each 
of these residues individually and characterized the pH profile and K d for the variants that formed 
the red fluorescent chromophore. No red fluorescent protein could be obtained for Lys70Gln and 
Glul48Gln, and absorbance scans of both variants revealed that the chromophore maturation process 
was stuck at the blue intermediate species similar to CH-GEC02.1-PVV described above [23]. 
Mutation His75Gln gave a pH profile that was very similar to CH-GEC02.1, albeit with a reduced 
response of 158% (Figure 4B). The Tyrl93Phe mutation yielded similar results with a pH curve 

2+ 

(Figure 4C) and Ca affinity that are similar to CH-GEC02.1. We conclude that this hydrogen bond 

2+ 

network is not involved in the Ca -dependent response of the CH-GECO variants, but does play an 
important role in the chromophore maturation process. 

Having ruled out our original hydrogen-bond network hypothesis, we reexamined the mCherry 
crystal structure and identified Tyr214, His204, Hisl72, and Aspl of CaM as ionizable functional 
group in reasonably close proximity to the chromophore (Supplementary Figure 2B). Each of these 
residues was mutated to non-ionizable alternatives in the context of CH-GEC02.1, and the pH profiles 
both with and without Ca 2+ were determined (Figure 4F-I). The pH profiles of all four mutants still 
retained the characteristic fluorescence increase at pH 6.5 to 7.0, indicating that none of these residues 
played a part in the Ca 2+ sensing mechanism. Notably, Aspl is the first amino acid of CaM and 
mutation to Gin did not alter the pH dependence, but it did increase the Kd to 53 nM and the 
fluorescence response to 307%. Due to its improved fluorescence response and a K d that is very similar 
to that of the high affinity Ca 2+ indictor YC-Nano50 [25], CH-GEC02.1 Aspl Gin may be particularly 

2+ 

useful for imaging of transient changes in Ca concentration in cells with low resting concentrations. 

Expanding our efforts to include non-ionizable residues close to the chromophore, we decided to 
target positions 159, 163, and 166, which differ between R-GECOl and CH-GEC02.1 (Supplementary 
Figure 2B). Introducing the Glyl59Ser (Figure 4D) or Lysl66Arg (Figure 4E) substitutions into 
CH-GEC02.1 resulted in variants where no substantial changes were observed in the Kd or excitation 

2+ 

and emission characteristics, and only modest effect on the Ca response in the case of the Glyl59Ser 
variant (Supplementary Table 1). In contrast, the Glnl63Lys substitution completely abolished the 
fluorescent response of CH-GEC02.1. This variant will be discussed in more detail in a following 
section. From these results it is apparent that, of all the residues targeted in these efforts, only Gin 163 
plays a critical role in the mechanism of action of CH-GEC02.1. 

3.2.4. The Influence of Residue Glnl63 on the Ca 2+ Response of CH-GEC02.1 

Residue 163 of DsRed [26], and its engineered descendants such as mRFPl [27] and mCherry [18], 
is in close proximity to the chromophore and thus substitutions at this position can have profound 
effects on the fluorescent characteristics. In wild-type DsRed, residue 163 is a Lys, which stabilizes the 
phenolate group of the chromophore in the anionic state by interaction with the protonated terminal 
amine [26]. During the directed evolution of dimeric and monomeric DsRed variants, favorable 
substitutions at this position included Lysl63Gln in the dimer2 variant and Lysl63Met in mRFPl [27]. 
During later directed evolution of mRFPl that ultimately led to the mCherry variant, the Metl63Gln 
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mutation was rediscovered and retained since this mutation provided improved chromophore maturation 
efficiency [18]. In contrast, the Metl63Lys reversion was found to be beneficial during the directed 
evolution that led to the mApple variant [16]. Through the directed evolution process that led to 
CH-GEC02.1, the Lysl63Gln substitution inherited from mCherry was retained. Notably, this residue 
is found in close proximity to both the chromophore and the circular permutation site, so it is prime 
candidate for a residue that would play a crucial role in the Ca 2+ sensing mechanism. 

The X-ray crystal structures of R-GECOl and RCaMP (engineered from Entacmaea quadricolor-derived 
mRuby [28]) have recently been reported [8]. In the R-GECOl structure, Lysl63 is interacting directly 
with the phenolate of the chromophore in the Ca 2+ bound state (Figure IB), which indicates that 
this residue almost certainly has a critical role in the Ca 2+ response, likely lowering the pK^ of the 
chromophore in the presence of Ca 2+ by stabilizing the negative charge on the phenolate. The structurally 
equivalent residue in RCaMP [8] is a methionine that is directed inside the barrel in a conformation 
that is very similar to that of Lysl63Gln in mCherry [24]. 

As mentioned in the previous section, introducing the Glnl63Lys substitution into CH-GEC02.1 
completely abolished the response of CH-GEC02. 1 and had a pronounced effect the protonation state 
of the chromophore as evidenced by the pH titration curve shown in Figure 5A. The absorbance, 
excitation, and emission wavelengths remain unchanged. To determine if the combination of Glnl63Lys 
and R-GECOl -type linkers would be sufficient for rescuing the Ca 2+ responsiveness, the Glnl63Lys 
substitution was introduced into CH-GEC02.1-PVV-ATA. The resulting variant formed the red 
fluorescent chromophore but did not show a significant fluorescence response to Ca 2+ (Supplementary 
Figure 1B,C). 

Figure 5. Effect of mutations at Gin 163. (A) The Glnl63Lys and Gin 163 Asp mutations 
abolish the response to Ca 2+ . (B) The Glnl63Met mutant retains a response to Ca 2+ . (C) In 
the X-ray structure of mCherry (PDB ID 2H5Q) [24], Gin 163 is directed into the interior 
of the protein in close proximity to the chromophore. Compare the orientation of Gin 163 
with the orientation of Lysl63 in R-GECOl as shown in Figure IB. 




pH pH 



To further probe the role of Gin 163, we created the Glnl63Met and Gin 163 Asp variants of 
CH-GEC02.1. Intriguingly, the Glnl63Met variant behaved very similar to CH-GEC02.1 in terms of 
its response to Ca + , though the Ca + response decreased to 54% and the K& decreases to 2 nM 
(Figure 5B). Much like the Glnl63Lys variant, the Gin 163 Asp variant exhibited no response to 
Ca 2+ (Figure 5A). Based on these results, we concluded that Glnl63 plays a critical role in the 
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CH-GEC02.1 sensing mechanism, and that this role is distinct from that played by the equivalent 
Lysl63 residue in R-GECOl. 

3.2.5. A Proposed Ca 2+ Sensing Mechanism for CH-GEC02.1 

The combined use of site directed mutagenesis and detailed in vitro characterization has enabled us 
to identify Gin 163 as one of the key residues in the Ca + sensing mechanism of CH-GEC02.1. In the 
absence of a crystal structure of CH-GEC02.1, it is of course difficult to predict how specific amino 
acids are oriented near the chromophore, especially given that we expect substantial distortion of the 
protein (relative to mCherry [24]) in the vicinity of the circular permutation site. Fortunately, the X-ray 
structures of R-GECOl and RCaMP are available to aid us in our effort to interpret our results. 

2+ 

We propose that, in the Ca free state, the side chain of Gin 163 is directed outside the barrel 
similar to the orientation of Lysl63 in R-GECOl (Figure IB). In this orientation the chromophore is 
less sterically constrained by the surrounding protein environment and thus has more rotational and 

2+ 

vibrational freedom, which explains the low quantum yield in the absence of Ca . When CaM and 

2+ 

Ml 3 interact due to Ca binding, the side chain of Gin 163 moves inwards and interacts with the 
chromophore in a conformation similar to that observed in mCherry (Figure 5C). That is, the side 
chain of Glnl63 is likely oriented below the chromophore and sandwiched between Trpl43 and 
He 161. Presumably, this interaction leads to a stabilization of the chromophore in a planar 
conformation that increases the quantum yield. This proposed mechanism is consistent with the results 
obtained with the Glnl63Met, Glnl63Asp, and Glnl63Lys variants. Mutation to the hydrophobic Met 
produces a functional protein, since the side chain is still able to undergo the Ca 2+ -dependent switch 
from the exterior to the interior of the protein. In contrast, mutation of Gin 163 to either Lys or Asp 
dramatically increases the energetic cost of bringing the charged side chain into the hydrophobic 
interior, and the Ca 2+ sensing mechanism is abolished. 

Disappointingly, we were not able to unambiguously identify key residues the contributed to the 
much higher affinity for Ca 2+ of CH-GEC02.1 relative to R-GECOl, nor we were able to identify the 
key residue that is undergoing an ionization with a pK a -6.5 to 7.0. We suspect that, in both cases, the 
relevant residues are located in the interface between the CaM domain and the FP domain. 

4. Conclusions 

The goal of this work was to gain insight into the mechanistic details of the recently reported 
genetically encoded Ca 2+ indicator CH-GEC02.1. While we were not able to answer all of the 
questions that naturally emerged from this effort, we were successful in identifying Gin 163 as the most 
important residue for transducing the conformational change associated with Ca 2+ binding into a 
change in the fluorescence intensity of the FP domain. We expect that this insight will be critical in 

9-i- 

guiding future efforts to further optimize genetically encoded Ca indicators for improved response. 
Another important conclusion from this work is that interdomain connections and interactions can 
have a complex influence on multiple aspects of a genetically encoded biosensor. Specifically, we 
observed that changes in linker composition and length could effect protein maturation and the sensing 
mechanism. In addition, our results suggest that interfacial residues play a crucial role in determining 

2+ 

the affinity of the indicator for Ca . We recommend that future efforts to optimize such indicators 
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should always involve substantial screening of linker libraries that vary in both length and 
composition. Also, we suggest that, given the complexity of the sensing mechanisms, directed 
evolution efforts that involve screening of libraries of randomly generated variants will tend to be 
much more effective than efforts guided by naive mechanistic hypotheses. 
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